arXiv:cs/0112012v2 [cs.DM] 12 Dec 2001 


Computing the average parallelism in 
trace monoids* 

Daniel KrofcJ Jean Mairesse^ Ioannis Michos ' 
February 1, 2008 


Abstract 

The height of a trace is the height of the corresponding heap of 
pieces in Viennot’s representation, or equivalently the number of fac¬ 
tors in its Cartier-Foata decomposition. Let h(t) and |f| stand re¬ 
spectively for the height and the length of a trace t. We prove that 
the bivariate commutative series ]T\ x h ^y I* is rational, and we give a 
finite representation of it. We use the rationality to obtain precise in¬ 
formation on the asymptotics of the number of traces of a given height 
or length. Then, we study the average height of a trace for various 
probability distributions on traces. For the uniform probability distri¬ 
bution on traces of the same length (resp. of the same height), the 
asymptotic average height (resp. length) exists and is an algebraic 
number. To illustrate our results and methods, we consider a couple of 
examples: the free commutative monoid and the trace monoid whose 
independence graph is the ladder graph. 

Keywords: Automata and formal languages, trace monoids, Cartier- 
Foata normal form, height function, generating series, speedup, per¬ 
formance evaluation. 


1 Introduction 


Traces are used to model the occurrence of events in concurrent systems 12 ]. 
Roughly speaking, a letter corresponds to an event and two letters commute 
when the corresponding events can occur simultaneously. In this context, 
the two basic performance measures associated with a trace t are its length |t| 
(the ‘sequential’ execution time) and its height h(t) (the ‘parallel’ execution 
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time). The ratio \t\/h(t) captures in some sense the amount of parallelism 
(the speedup in [|||). Let M be a trace monoid. Define the generating series 


ieM 




te M 


H = 


ieM 


h(t) 


It is well known that L is a rational series [||]. We prove that F and H are 
also rational and we provide finite representations for the series. Exploiting 
the symmetries of the trace monoid enables to obtain representations of 
reduced dimensions. We use the rationality to obtain precise information 
on the asymptotics of the number of traces of a given height or length. 

Then, given a trace monoid and a measure on the traces, we study the 
average parallelism in the trace monoid. One notion of average parallelism 
is obtained by considering the measure over traces induced by the uniform 
distribution over words of the same length in the free monoid. In other 
terms, the probability of a trace is proportional to the number of its re¬ 
presentatives in the free monoid. This quantity was introduced in [27] and 
later studied in ||. || j|, 14, | 


. Here we define alternative notions of average 
parallelism by considering successively the uniform distribution over traces 
of the same length, the uniform distribution over traces of the same height, 
and the uniform distribution over Cartier-Foata normal forms. We prove in 
particular that there exists Am and 7 m in Ml such that 


EteM,|t|=ra MO 


£ 


t£M.,h(t)=n 


\t\ 


n ■ #{t € 


= n} 


n ■ ff{t € M, h(t) = n} 


Furthermore, the numbers Am and 7 m are algebraic. Explicit formulas in¬ 
volving the series L and H are given for Am and 7 m- 


The present paper is an extended version with proofs of 124]. 


2 The Trace Monoid 

We start by introducing all the necessary notions from the theory of trace 
monoids. The reader may refer to ||TT| , |l2| for further information. 

In the sequel, a graph is a couple (N, A) where IV is a finite non-empty 
set and Ac N x N. Hence we consider directed graphs, allowing for self¬ 
loops but not multi-arcs. Such a graph is non-directed if A is symmetric. 
We use without recalling it the basic terminology of graph theory. Given a 
graph and two nodes u and v, we write u —> v if there is a path from u to v. 

Fix a finite alphabet E. Let D be a reflexive and symmetric relation on 
E, called the dependence relation, and let I be its complement in E x E, 
known as the independence or commutation relation. 

The trace monoid , or free partially commutative monoid , M = M(E, D) 
is defined as the quotient of the free monoid E* by the least congruence 
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containing the relations ab ~ ba for every (a, b ) E I. The elements of M 
are called traces. Two words are representatives of the same trace if they 
can be obtained one from the other by repeatedly commuting independent 
adjacent letters. 

The length of the trace t is the length of any of its representatives and 
is denoted by |f|. Note that we also use the notation |Sj = #5 for the 
cardinal of a set S. The set of letters appearing in (any representative of) 
the trace t is denoted by alph(f). The graphs (X,D) and (X,I) are called 
respectively the dependence and the independence graph of M. Let finally ip 
denote the canonical projection from X* into the trace monoid M. In the 
sequel, we most often simplify the notations by denoting a trace by any of 
its representatives, that is by identifying w and ij)(w). 

Example 2.1. Let X = {{1,2}, {1,3}, {1, 4}, {2, 3}, {2, 4}, {3, 4}} (the set 
of subsets of cardinal two of {1,2, 3,4}). Define the independence relation 
I = {(u,v) : u fl v = 0}. The dependence graph (X, D) is the line graph of 
the complete graph K 4 , also called the triangular graph T 4 . For notational 



Figure 1: The dependence graph T 4 (left) and its independence graph (right). 

simplicity, set The dependence graph is represented on the left 

of Figure [l] and the independence graph on the right. In the trace monoid 
M(X,D), we have r = = a34ai2023ai4a23. 

A clique is a non-empty trace whose letters are mutually independent. 
Cliques are in one-to-one correspondence with the complete subgraphs (also 
called cliques in a graph theoretical context) of (X, I). We denote the set of 
cliques of M by £. 

An element (u,v) E £ x £ is called Cartier-Foata (CF-) admissible if 
for every b E alph(u), there exists a E alph(rt) such that (a, b) E D. The 
Cartier-Foata (CF) decomposition of a trace t is the uniquely defined (see 
(§, Chap. I]) sequence of cliques (ci, C 2 ,..., c m ) such that t = C 1 C 2 • • • c m , 
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and the couple (cj, Cj + 1 ) is CF-admissible for all j in {1 ,m — 1}. The 
positive integer m is called the height of t and is denoted by h(t). In the 
visualization of traces using heaps of pieces, introduced by Viennot in 32], 
the height corresponds precisely to the height of the heap. 


Example 2.2. Consider the trace monoid defined in Example 2.1. The set 
of cliques is £ = {a, a E S}u{ai2 034, 013024, 014023}. The CF decomposition 



Figure 2: Heap of pieces. 

of r is ( 012034 , 014023 , 023 ). We have |r| = 5 and h(r) = 3. We represented 
the heap of pieces associated with r on Figure ||. 


3 The Graph of Cliques 


We define the graph of cliques T as the directed graph with <£ as its set of 
nodes and the set of all CF-admissible couples as its set of arcs. Note that 
T contains as a subgraph the dependence graph (S,D). The graph T is in 
general complicated and looks like a maze. 


Example 3.1. We continue with the model of Examples 2.1 and 2.2. For 



Figure 3: The complement of the graph of cliques of X 4 . 
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simplicity, the graph represented Figure is the complement of the corre¬ 
sponding graph of cliques (the complement of the graph (N, A) is the graph 
(IV, (N x N) — A)). 


Lemma 3.2. If the dependence graph is connected, then the corresponding 
graph of cliques is strongly connected. 

Proof. Let (T*,D) be the dependence graph, £ the set of cliques, and T the 
graph of cliques. Given u, v E £, we want to prove that there is a path from 
u to v in T. We argue by induction on the value of |m| + |u|. If |u| + |u| = 2, 
the result follows by the connectivity of the dependence graph (£, D ). 

Now consider the case |k| + |w| > 2. Assume first that |u| > 1. Let a 
belong to alph(rt). Clearly ( u,a ) is CF-admissible. By induction, we have 
a v and we deduce that u —» v. 

Assume now that \u\ = 1. Then we have |u| > 1 and let v = v'ab,a,b E 
£. By induction, we have u —► v'a. Let us prove that v'a —> v'ab. By 
connectivity, there exists in (E,D) a path (co = o, • • • , = b). For j E 
{0, ...,*}, set Vj = v'acj if v'acj E £ and otherwise set Vj = wcj where 
w is the longest trace such that alph(rw) C alph(t/a) and wcj E C. By 
construction, we obtain that (no = v'a ,..., vy. = v'ab) is a path in T. It 
completes the proof. □ 


The above lemma can be restated as follows: given two cliques u and 
v there exists at least one trace in M such that the first factor in its CF- 
decomposition is u and the last one is v. 


We now use a standard reduction technique for multi-graphs (see j 1C, 
Chap. 4] or |17], Chap. 5]). We partition the nodes of T based on their set of 
direct successors. An equitable partition of Cl is a partition n = {£i,..., £ s } 
with the property that for all i and j the number a t j of direct successors 
that a node in Cl, has in is independent of the choice of the node in 
The s x s matrix A n = (is called the coloration matrix corresponding 
to 7r. In the case of the partition {{c},c E £}, the coloration matrix is the 
adjacency matrix of T. 


Example 3.3. We keep studying the model of Examples [2 .1 1 . 2.2 and 3.1 . 
Consider the partition 7r of £ defined by 


£l — {ai2,013, Am}, £2 — {^23) «24, «34}, ^3 — {ai2«34, ^13024, ^14023} ■ 


It is easily checked that the partition is equitable. The corresponding col¬ 
oration matrix is 


A n — 


2 2 \ 
3 2 
3 3/ 


3 

2 

3 
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A natural family of equitable partitions is the one induced by the non¬ 
trivial subgroups of the full automorphism group of T. Given such a group 
G , the cells of the corresponding partition ttq are the orbits into which £ is 
partitioned by G. The corresponding coloration matrix is denoted by Aq- 

An automorphism of (£, D) induces an automorphism of T. Indeed, 
consider an automorphism <f> of (X, D ). The map cj> : X —> E can be extended 
into a map <//:£—>£ as follows. Given c = u\- ■ ■ Uk £ £ with \ui\ = 1 for 
all i , set 4>'(c) = cj)[u\) ■ ■ ■ (j>{uk)■ Note that the definition is unambiguous 
since the letters commute. It is immediate that (ft is an automorphism 
of T. 

Due to the complex structure of T, finding its automorphisms is in gen¬ 
eral difficult. Finding the automorphisms of (E,D) is often an easier task. 
This simple observation allows us to focus on the automorphism groups of 
(E,D) and to consider their action on the nodes of T. When (E,D) has 
a great amount of symmetries, the corresponding reduction can be very 
important (see section [h^). 

Below we need to consider equitable partitions such that all the cliques 
in the same cell have a common length. This requirement is always satisfied 
for the equitable partitions associated with automorphism groups. 


Example 3.4. The model is the one of Examples \2.1[ [2.2| , [3.1| , and T3. The 
symmetric group ©4 of degree 4 is a non-trivial group of automorphisms of 
(E,D). It is of index 2 in the full automorphism group G of (E,D). The 
partition of £ induced by ©4 (or by G) is £1 = {a, a £ E} and £2 = 
{ 012034 , fli 3 « 24 , «i 4 fl 23 }- The coloration matrix is given by 


^-e 4 — 



4 Height and Length Generating Function 

Let F £ N[[x, y]] be the height and length generating function defined by 
F(x,y ) = = ^2 fk,i x k y l , 

teM k,i eN 

where x and y are commuting indeterminate and f^i is the number of traces 
of height k and length l. Set H{x ) = F(x, 1) and L(y) = F( 1, y). Then H(x) 
and L(y ) are respectively the generating functions of the height and of the 
length. The Mobius polynomial n(E,I) of the graph (£,/) is defined by 

Ai (S,/) = l + ^(-l)M y M. (i) 

It is well known [||, Chap. II] that L(y) is equal to the inverse of the 
Mobius polynomial, i.e. L(y) = ^(E,/) -1 . In particular, it is a rational 
series. 
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Proposition 4.1. Let M = M(E,D) be a trace monoid and let £ be the 
set of cliques of (E,I). Define the matrix A(x,y ) £ N[x,y] e ' x£ by setting 
A(x,y)ij = xy\ l \ if (i,j) is CF-admissible and 0 otherwise. Define also 
u = (1,..., 1) £ N[x,y] lxC and v{x,y ) = {xy^)i £ N[x,y] £xl . The height 
and length generating function is then given by 

F-l = J ^uA(x,y) n v(x,y ) = u (/ - A(x, y))~ l v(x, y) , (2) 

neN 

where 1 is the identity o/N[[x,y]] and I is the £ x £ identity matrix. 

Proposition [0] states that F(x, y) is a rational series of N[[x, y]\ and 
that (u,A(x,y),v(x,y)) is a finite representation of it. 


Corollary 4.2. The series L and H are rational, and we have L = 1 + 
u (I — A{1, y )) -1 u(l, y) and H = 1 + u (I — xA( 1,1 )) _1 xu(l, 1). 


Proposition 0 and Corollary [L^, although easy to prove, do not seem 
to appear in the literature. In the case of the length generating series, the 


rationality is not new but Corollary L2 provides a new formula for L. 

There exist related results in the context of directed animals. Indeed 
there is a bijection between directed animal of width k on a 2 d triangular 
lattice and traces in the monoid M(£, D) with E = {a\, ..., a *.} and D = 
{( ai,aj ), |i — j | < 1}. The precise asymptotics for such directed animals are 
derived in |2l], p5| with the same method as in the proof of Proposition |4. 


More generally, the method of proof of Proposition 4.1 can be viewed as an 
instance of the transfer matrix method p0| , Chap. 4.7]. 

In the context of trace monoids, the idea of working with the alphabet 
of cliques £ to study the height function appeared in || and was later used 
in [O. 


Let 7 r = {Ci,... ,£ s } be an equitable partition of £ such that all the 
cliques in £j have a common length Zj. Let A n = (aij)ij £ N sxs be the 
coloration matrix. Define the matrix A n (x,y) £ N[x,y] sxs by A w (x,y) = 
(■ aijxy li )ij. Define u n = (|£j|); £ N[x,y] lxs and v 7r (x,y) = ( xy li )i £ 
N[x,y] sxl . Then formula (|2|) holds when replacing u,A(x,y), and v(x,y), 
by u n , A 7T (x,y), and v w (x,y). The proof is similar to the one below. 


Proof of Proposition {j-A - As recalled above, with each trace is associated its 
unique CF decomposition. We associate with a path p in T the sequence of 
its nodes (ci,..., c^). By construction, the CF decomposition of the trace 
t = ci ■ ■ ■ Cfc is precisely (ci,..., c*,). In other words, the CF decompositions 
of traces are in one-to-one correspondence with the paths in T. The contri¬ 
bution of the trace t to the series F is x h ^y ^L The weight of the path p in 
the weighted automaton (u,A(x,y),v(x,y)) is 


k -1 


fc-i 


u, 


Cl 


A{x,y) CiCi+1 )v(x,y) Ck = (JJ xy lcil )xy M = x h(t) y 


1*1 


1=1 


2=1 
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This completes the proof of the result. 


□ 


It is easily checked that the series F(x, y ) is not recognizable in general. 
We recall that F = i fk,i xk y l is a recognizable series of N[[x,y]] if there 
exists K E N*,a E N lx ^,y(x) E N KxK ,p(y) E f^ KxR i and (3 E N Kxl , such 
that fk,i = ap(x) k p(y) 1 /3 for all k and l. 


Example 4.3. We persevere with the model of Examples | 2 . 1 
and 


3.1, 3.3, 


The height and length generating function is given by 


F 


3 )(-^ 

1 + xy 

1 — 5 xy — 3 xy 2 + 3 x 2 y 3 


—2 xy 
1 — 3 xy 2 


xy \ 
xy 2 ) 

(3) 


Setting x = 1, we check that the length generating function is the inverse of 
the Mobius polynomial, i.e. L = (1 — 6 y + 3y 2 ) -1 . Setting y = 1, we obtain 
the height generating function H = (1 + x)(l — 8 x + 3x 2 ) -1 . The Taylor 
expansion of the series F around 0 is 


F = 1 + 6 xy + 3 xy 2 + 30 x 2 y 2 + 30 x 2 y 3 + 150x 3 y 3 + 9 x 2 y 4 + 222 x 3 y 4 + 
750x 4 y 4 + 126x 3 y 5 + 1470x 4 y 5 + • • • + 71910x 6 y 8 + • • • . 


For instance, there are 126 traces of length 5 and height 3, or 71910 traces 
of length 8 and height 6 . 

We now use Proposition |4.1| to provide some precise results on the asymp¬ 
totics of the number of traces of a given length or height. 


Given a complex function analytic at the origin, a singularity is a point 
where the function ceases to be complex-differentiable. A dominant singu¬ 
larity is a singularity of minimal modulus. Throughout the paper, given a 
series S E N[[x]], we set S = Yl n (^\ n )x n ■ When applicable, we denote the 
modulus of the dominant singularities of S (viewed as a function) by p$. 
Classically, see |], 13, 33], the asymptotic growth rate of (S\n) is linked to 


the values of the dominant singularities. 


Lemma 4.4. We have pl = l or pn = l if and only */M(E,D) is the free 
commutative monoid over E. 


Proof. We have lirrisup n (L|n) 1//n = 1/pl, and lirnsup n (i7|n) 1//n = 1/ph 
(the ‘exponential growth formula’). It implies that pl < 1 and pu < 1. 

Assume there exists (a, b) E D with a b. Then all the traces t\ ■ ■ ■ t n 
with ti E {a, b} are of length n and height n. It implies that (L|n) > 2 n and 
that (H\n) > 2 n . It implies in turn that pl < 1/2 and pn < 1/2. 

Assume now that M(S,_D) is the free commutative monoid 


computation or using the results from section 6 T, we get (L|n) 
and (H\n) ~ It implies that pr = 1 and pu = 1. 


By direct 
1 = 1-1 


~ n' 


□ 











Proposition 4.5. Let (£,D) be a connected dependence graph. Then L and 
H have a unique dominant singularity which is positive real and of order 1. 


It follows (see JT], 13, 3^|) that when (£, D) is connected, we have (L\n) ~ 
a L pf n and (H\n) ~ ot H p H n , with a L = pf l ■ [L(y)(p L - y)\\ y=PL and a H = 
Ph 1 • \H{x){ph ~x))\ x=Ph . 

The proof of Proposition L5 is based on the representation given in 
Proposition |4.1|. F or convenience reasons, the proof is included in the proof 
of Proposition |5.1| and given in Appendix. 


Proposition 4.6. Let (£, D) be a non-connected dependence graph. Let 
(£ s , D s ) s£ s he its partition into maximal connected subgraphs. Denote by 
L S ,H S , the corresponding length and height generating functions. Then one 
has: 

1) the series L has a unique dominant singularity equal to pl = min s p£ s , 
and whose order is ff{s,pL s = Pl},' 

2) the series H has a unique dominant singularity equal to pu — FIs Ph s ■ Its 
order is |£| ifM.(E,D) is the free commutative monoid, and 1 + #{s, |£ s | = 
1 } otherwise. 


Let Ll and kn denote the respective orders of pi in L and pn in 
H. It follows from the above Proposition (see [|], [l3|, |33|) that we have 
(L\n) ~ aLn kL ~ 1 pf n , and (H\n) ~ aH‘n kll ~ 1 pjf 1 with aL = (pf kL /(kL — 
l )^'[ L {y){pL-y) kL ]\ y=PL and a H = {p H kiI /{k H -l)\)-[H{x){p H -x) kH }\ x=PH . 

Proof. We have L = [{. L s = }{„ /^(S s , I s ) _1 where p(.) is defined in ([[]). It 
implies directly the result on pl- 

Consider now the height generating function. We prove the result by 
induction on |5|. Assume first that jfS = 2 and set S = {1,2}. We have 
H = Yli j{Hi\i)(H 2 \j)x ma ' x ^’I\ It implies that 


(H\n) = (H\\n) £(fr 2 |i) + (H 2 \n) i|i) - (I?i|n)(^ 2 |n) . (4) 

i=0 z=0 


Applying Proposition |1.5| . we obtain (H\\n) = a n p H ”, with lim n a n = a E 
M?j_) and (H 2 \n) = bnpfj ”, with lim n b n = b E 

Consider first the case pu 1 < 1 and pn 2 < 1- We have 


(iL!|n)^(iL 2 |i) 

i=0 


— «nP g ?(^ bjp H \ ) 

i=0 




n 

anbpHyj^C^ibn-i/fypHz ) 
i=0 

ab{ 1 - PH 2 )~ 1 (PH 1 PH 2 )~ n - 
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The same type of identity also holds for the second term in (Q). Going back 
to (Q), we then obtain 

(H\n) ~ ab(( 1 - p H J ” 1 + (1 - Ptf 2 ) _1 - 1 )(ph 1 Ph 2 )~ ti ■ 


Hence we have pu = PHxPh 2 and the order of pu in H is 1. 


We consider now the case pu x = 1 and pu 2 = L By Lemma |4.4| , we get 
that M(£, D ) is the free commutative monoid over two letters. Applying 
we get that (H\n) = (2n + 1). Hence we have pu = 1 and the order of 
pu in H is 2. 

By symmetry, the last case to consider is pu x < 1 and pu 2 = 1- By 
Proposition L5, we have (Hi\n) ~ a Pu[- We also have {H. 2 \ri) = 1. Sim¬ 
plifying ([|), we obtain that (H\n) ~ anp’jJ 1 . It implies that pu = Pu x and 
that the order of pu in H is 2. 

Consider now the case #S > 2. Let (Si, D\) and (S 2 , D 2 ) be a partition 
of (S ,D) in two subgraphs such that (Si,Di) is connected. The induction 
hypothesis applies to (S 2 ,L ) 2 ) and the proof follows exactly the same steps 
as above □ 


The results on L in Proposition 4.5 and Proposition |4.6| can be restated 
as results on the smallest root of the Mobius polynomial of a non-directed 
graph. They improve on a recent result by Goldwurm and Santini (li]] 
stating that the Mobius polynomial has a unique and positive real root 
of smallest modulus. Our proof of Proposition 5.1 follows several of the 
steps of |Ii|. One central difference is that we work with Cartier-Foata 
representatives instead of minimal lexicographic representatives. Proving 
the strengthened statements while working with the latter does not appear 
to be easy. 


A matching in a (non-directed) graph is a subset of arcs with no common 
nodes. The matching polynomial of a graph is equal to Ylk(—^) km ky k i where 
rrifc is the number of matchings of k arcs. Hence, the matching polynomial 
of a graph G is equal to the Mobius polynomial of the complement of the 
line graph of G. Matching polynomials have been studied quite extensively. 
It is known for instance that all the roots of a matching polynomial are real 
. It implies that the same is true for the Mobius polynomial of a graph 
which is the complement of a line graph. For a general graph, the result is 
not true and one has to settle for the weaker results in Proposition |4.5| and 
Proposition [ 0 | Consider for instance the graph with nodes {a, 6 , c, d} and 
arcs {(a, b ), (6, a), (a, c), (c, a), ( b , c), (c, b)}. It is the smallest graph which is 
not the complement of a line graph. Its Mobius polynomial is p = 1 — 4y + 
3 y 2 — y 3 , which has two non-real roots. 
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5 Asymptotic Average Height 


We want to address questions such as: what is the amount of ‘parallelism’ in 
a trace monoid? Given several dependence graphs over the same alphabet, 
which one is the ‘most parallel’? To give a precise meaning to these ques¬ 
tions, we define the following performance measures. Let M n denote the set 
of traces of length n of the trace monoid M. We equip M n with a probability 
distribution P n and we compute the corresponding average height 

E n [h] = ^ Pn{t}h(t) . 

te m„ 

Assuming the limit exists, we call lim n E n [h\/n the (asymptotic) average 
height. Obviously this quantity belongs to [C -1 ,1], where C is the maximal 
length of a clique. Clearly the relevance of the average height as a measure 
of the parallelism in the trace monoid depends on the relevance of the chosen 
family of probability measures. This may vary depending on the application 
context. A very common choice is to consider uniform probabilities. It is 
the natural solution in the absence of precise information on the structure 
of the traces to be dealt with. Let us consider different instances of uniform 
probabilities over traces. 


5.1 Uniform probability on words 

Let n n be the uniform probability distribution over TP which is defined by 
setting fi n {u} = l/|£| n , for every u £ T n . We set P n = fj, n o ip -1 , i.e. 
P n {t} = pL n {w : ip(w) = £}■ The limit below exists: 


A. = A„(£ ,D) = lim Aid = lim WW) 


n 


n £ 


(5) 


This is proved using Markovian arguments in [ 
also be proved using sub-additive arguments. 


27]]. The existence of A* can 
More precisely, it is shown 
in [14] that hpip(.)) is recognized by an automaton with multiplicities over 
the (max, +) semiring, which provides a different proof of the existence of 
A*. In fact a stronger result holds. Consider a probability space (G,T, P). 
Let (x n ) n£ N* be a sequence of independent random variables valued in £ 
and uniformly distributed: P{x n = u] = l/|£|,it £ £. The probability 
distribution of (xi ■ ■ ■ x n ) is then the uniform distribution over It is 

proved in @0 that 


P{ lim 


h(tp(x i • • -x n )) 


n 


— A* } — 1. 


( 6 ) 


Except for small trace monoids, A* is neither rational, nor algebraic. The 
problem of approximating A* is NP-hard Q. Non-elementary bounds are 


11 






proposed in ||. Exact computations for simple trace monoids are proposed 
in H 28|. A software package named Ers [^2|] enables to simulate and 
compute bounds for A*. 


5.2 Uniform probability on traces 

A natural counterpart of the above case consists in considering the uniform 
probability distribution over M n , i.e. Q n {t} = l/|M n | for every t £ M n . 
Assuming existence, we define the limit 


= Am(£, D) = lim 


E n [h] 


= lim ■ 


n 


n|M n | 


= lim 


Z)m.gN m fm,n 
SmgN n fm,n 


(7) 


Dually, let m M be the set of traces of height m, and let Q m be the 
uniform probability measure on m M, i.e. Q m {t} = l/| m M| for every t £ m M. 
The average length of a trace in m M is equal to E m [l\ = Qm{t}\t\. 

Assuming existence, we define the limit 


7m = 7m(£, D) = lim 

m m 


= lim 

m 


Etg m M 1^1 

m| m M| 


ZlngN n fm,ri 
m X)ngN m fm,n 


( 8 ) 


The quantity 7 m is an (asymptotic) average length. The analog of A* and 
Am is then the quantity q^ 1 . 

Proposition 5.1. The limits Am in and 7 m in exist. Furthermore, 
Am and 7 m are algebraic numbers. 


The proof, based on Proposition 4.1, is rather long and we postponed it 
to the Appendix. In fact, the proof of Proposition 5.1 provides a formula 
for Am and tm- Define G = (dF/dx)(l, y) and G = (dF/dy)(x, 1). Then, 
with the notations of section 0, we have 


[G(y)(pL - y) kL+l }\y= PL = [G(x)(p H ~ x) kH+1 ]\ X=PH 

k L p L [L(y)(p L - y) k r]\ y=pL ’ 7M k H pH[H(x)(p H - x) k H] ]x=pn ' 

(9) 


5.3 Uniform probability on CF decompositions 

In this section, we use some basic results on Markov chains, for details see 
for instance J|, ^6], ^9|. Let A £ {0,1 } £xC be the adjacency matrix of T. We 
associate with A = (, the Markovian matrix 

A — (a>ij)ij. aij — Qjj ( ^ ^ ajk ) • (10) 

k 
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We define the vector 1 E M lxC by 1 \ = 1/|(£| for all 7 We define the 
probability measure R m on m M as follows: for a trace t E m M with Cartier- 
Foata decomposition (ci,..., c m ), we set R m {t} = l ci a cic2 ■ ■ ■ a Cm _ 1Cm . 

An interpretation for the family (R m ) m is as follows. Consider a Markov 
chain (X n ) n on the state space £ with transition matrix A and with initial 
distribution 1. Then R m {t} = P{X\ ■ ■ ■ X m = t}. Equivalently, given a 
trace t of height m, we get a trace t' of height m + 1 by picking at random 
and uniformly an admissible clique c and by setting t' = tc. This can be 
loosely described as a ‘uniform probability on CF decompositions’. 

The average length of a trace in m M is equal to E m [l\ = Ylte m M Pm{t}\t\- 
Assuming existence, the analog of A*, Am or y^ 1 is then the (asymptotic) 
average height 

TYl 

A cf = A rf (E, D) = lim . (11) 

m h m [L\ 

Let p = (p(c )) c be defined by p = lim n 1(7 + A + • • ■ + A n ~ l )/n. It can be 
interpreted as the limit distribution of the Markov chain (X n ) n . According 
to the ergodic theorem for Markov chains (Theorem 4.6 in |29|]), the limit 
exists in (0 and we have 

Ad = (EpMM )- 1 ■ ( 12 ) 

esc 

When (T,,D) is connected, it follows from Lemma |3.2j that A is irre¬ 
ducible. Then p is entirely determined by pA = p and Pi = 1 (Perron- 
Frobenius Theorem, see |2l| ). It implies that A c f is explicitly computable and 
rational. When (S,D) is non-connected, A c f is still explicitly computable 
and rational according to Proposition |5.4| . 

Consider an equitable partition it = {(£i,..., £ s } such that all the cliques 
in have a common length l t . There exists an analog of (|l^) corresponding 
to this partition. Let A n be the Markovian matrix associated with the 
coloration matrix A n . Let p n be defined by p n = lim n 1(7 + A n + ■ ■ ■ + 
A r )~ 1 )/n. Then, we have A cf = P-k^UT 1 ■ 

5.4 Non-connected dependence graphs 

Assume that (E,7A) is non-connected and let (E s , D s ) s£ s be the maximal 
connected subgraphs of (T,,D). We now propose formulas to express the 
average height of (S,D) as a function of the ones of (E S ,D S ). 

First, it is simple to prove using (| 6 |) and the Strong Law of Large Num¬ 
bers (see also Theorem 5.7 in |27j) that we have 

A*(E,77) = max ( ■^■A*(E S ,D S ) ) . (13) 

s£S 2j 
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Proposition 5.2. Denote by L s the length generating function of (X s , D s ). 
Define J = {j 6 S, pLj = minggs pl s }- Then, we have 

Am(£, D) = Am(£j, Dj) , (14) 


where Xj = UjgjXj, and Dj = Ujg jDj. 

The proof uses Proposition |5.1| and is given in Appendix. There seems 
to be no simple way to write Am(Xj, Dj) as a function of Am(Xj, Dj),j E J, 
as illustrated by the example of section | 6 . 1 |. 


Proposition 5.3. Define J = {j E S, |Xj 


7m(X,7A) = u(T,j,Dj) + 


> 1}. Then, we have 

\s-J\ 


(15) 


if J 7 ^ 0. If J = 0, that is ifM.(T,,D) is the free commutative monoid, we 
have tm(£, D) = (|X| + l)/2. 


The proof is given in Appendix. Proposition 5.3 is the counterpart of 
Proposition for tm, but it is more precise. 

Proposition 5.4. Let A be defined as in section 5.8. Let (7 S be the set of 
cliques of(T, s ,I s ). Define the matrix B of dimension (7x(7 as follows: Bij = 
A-ij if i £ Uses and Bij = 0 otherwise. Define the vectors E S, of 

dimension (7 as follows: (J^)j = 1 if i € <t s and (?£ s )i = 0 otherwise. Set 
q s = 1(7 — B)- l Tr s , where 1 = (1/|£|,. . ., 1/|£|). Then we have 

A c f(X, D)~ l = ^ q s A c f (S s , 7?s ) _1 ■ (16) 

seS 

Proof. The graph of cliques T of (S,7) can be decomposed in its maximal 
strongly connected subgraphs (mscs). Replacing each mscs by one node, we 
define the condensed graph of T. The final mscs are the mscs without any 
successor in the condensed graph. According to Lemma L2, the final mscs 
are precisely the ones with sets of nodes (7s where s € S. 

Remark that the non-negative matrix B is such that B^ < 1 for 
every i E (7. In particular, it implies that (7 — B) is invertible. Define 
q s = 1(7 — T?) -1 !^ for every s E S. 

The quantities q s can be interpreted in terms of the Markov chain (X n ) n 
defined in section 5.3: we have q s = lim n P{X n E (7 S } (Theorem 4.4 in p9[| ). 
Let A s be the restriction of A to the index set (<7 S x 61s) an d let A s be 
the Markovian matrix associated with A s . Let p s be the unique probability 
distribution on (7 S such that p s A s = p s (Perron-Frobenius Theorem, Chapter 
1 in [p9f) . According to the ergodic theorem for Markov chains (Theorem 
4.2 in j29|l), we have A cf (S s ,7?s) = (E c gc s Ps(c)|c|) _1 . 

Define the vector p s of dimension (7 by p s (c) = p s (c) if c E <7 S an d 
p s (c ) = 0 otherwise. The vector p = YIsgsQsPs then the unique limit 
distribution of (X n ) n . By the ergodic theorem for Markov chains (Theorem 
4.6 in |[29|1), we obtain (|l6|). □ 
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5.5 Comparison between the different average heights 

In terms of computability, the simplest quantity is A c f and the most compli¬ 
cated one is A*. This is reflected by the fact that A c f is rational, that Am and 
are algebraic, and that A* is in general not algebraic, see for instance 

&■ 

Another point of view is to compare the families of probability measures 
(P n )n, ( Qn)n, ( Qn)n , and {R n ) n associated respectively with A*, AmjTmS and 
A c f. A family of probability measures ( n n ) n defined on (M n ) n (or („M) n ) is 
said to be consistent if we have n m {t} = n n {v : 3u, v = tu } for all m < n. 
In this case, there exists a unique probability measure on infinite traces 
whose finite-dimensional marginals are the probabilities (n n ) n - Consistency 
is a natural and desirable property. Clearly the families ( P n )n and ( R n )n 
are consistent. On the other hand, the families (Q n )n and (Q n ) n are not. 

It is also interesting to look at the asymptotics in n of the empirical 
distribution of {h(t)/\t\,t € M n } or {\t\/h(t),t € n M}. For a G M, let 5 a 
denote the probability measure concentrated in a. It follows from (p) that 
we have 


p n{t}S h (t)/\t\ —> < 5 a , , 

t 

with the arrow standing for ‘convergence in distribution’. Similarly, it fol¬ 
lows from the ergo die theorem for Markov chains that we have 

'Y^Rn{t}8 h {t)/\t\ —> , 

t seS 

the notations being the ones of section There are no such concentration 
results for ( Q n ) n and ( Q n )n • To check this, consider the case of the free 
commutative monoid over two letters. We obtain easily that 

^ ^ Qn{t}^h(t)/\t\ * Pi ^ y Qn{t}^\t\/h(t) * F" , 

t t. 

where U is the uniform distribution over the interval [1/2,1] and V is the 
uniform distribution over the interval [1,2]. 

Consider two dependence graphs (T*,D i) and (S,Z? 2 ) with D\ C D 2 . 
The intuition is that M(S,iAi) should be ‘more parallel’ than M(£,Z) 2 ). In 
accordance with this intuition, it is elementary to prove that A*(S,T>i) < 
A*(S,T> 2 )- However, the corresponding inequalities do not hold for Am and 
A c f. Consider for instance the trace monoids over three or four letters whose 
average heights are given in section |B]. This raises some interesting issues 
on how to interpret these quantities. On the other hand, we conjecture that 
the inequality tm(S,T)i) _1 < 7 m(£,IA2) -1 is satisfied. 
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6 Some Examples 


6.1 The free commutative monoid 


Consider the dependence graph (E,D) with D = {(u,u),u £ E}. The 
corresponding trace monoid M(E,D) is the free commutative monoid over 
the alphabet E, which is isomorphic to N s . Set now k = |E|. 

A direct application of (|l|) yields A* = l/k. Consider now A c f. The 
final maximal strongly connected subgraphs of T are precisely the cliques of 
length 1. In particular, they are of cardinality 1. Applying the results in 


section |5.3| , we get A c f = 1. 

Let us compute Am and 7 m- Using the methodology of sections [| and || 
is feasible, but there are simpler methods. Consider 7 m first. By a counting 
argument, we get 


— (1 + V + 

tSm M 


+ y m ) k -(l + y + --- + y m ~ i ) 


m—l\k 


(17) 


We obtain | m M| = L m (l) = (m + l) k - m k and Ete m M 1*1 = L 'm(X) = 
k(m + l) k m/2 — km k (m — l)/2. We deduce that = 2 /{k + 1). 

Let us now compute the average height Am- The length generating func¬ 
tion is L = (1 — y)~ k ■ Applying a result from Carlitz jjj], we have 

■+n k 


G = (dF/dx)(l,y) = 


Y, max(m,... ,n k )y ni 

(m,...,n fc ) 

1 

E 


(1 - y) k + l 


Using (|) , we obtain 



i —1 




1 + y h- Vy 


i— 1 


! + - + 




(18) 


(19) 


The last equality is a classical identity for harmonic summations (see Chap¬ 
ter 6.4 in (2(|). Asymptotically in k, we have Am ~ log {k)/k. This is to be 
compared with A* = l/k and ~ 2/k. 

Consider now the trace monoid M(E, D) obtained as the direct product 
of the free monoids with |Ei| = ••• = |E*,| = c and c > 1. 

Equivalently, the dependence graph is (E, D) with E = U^ =1 Ej, D = u£_ 1 Di, 
and Di = Ej x Ej for all i. Clearly, we still have A* = l/k and A c f = 1. The 
formulas in ( 0 ) and (|l8|) still hold when replacing y by cy. We deduce that 
Am is still given by (|l9|). On the other hand, we have = l/k, a value 
which can also be obtained using Proposition Hence the value of 
does not depend on the value of c, c > 1, and is different from the value 
obtained for c = 1. 
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6.2 The ladder graph 

In view of Proposition [4.1 1 . the simplest sets of cliques are those with the 
property that the clique partition according to the length is equitable, so 
that the dimension of the corresponding coloration matrix reduces to the 
maximal size of a clique. This holds if the full automorphism group of 
(£, D ) or r acts transitively on the sets of cliques of the same length. This 
is in particular the case when the dependence graph is the triangular graph, 
i.e. the line graph of the complete graph K n , or the square lattice graph, i.e. 
the line graph of the complete bipartite graph K n n . 

A particularly simple class of independence graphs is the class of node 
and arc-transitive triangle-free graphs. In this case, the coloration matrix 
associated with the full automorphism group is of dimension 2x2. Let us 
consider a family of graphs of this type. 

Let (£,/) be the ladder graph, i.e. £ = {1, ...,2n} and I = {( i,j ) G 
£ x £ : i + j = 2n + 1}. The corresponding dependence graph is known as 
the cocktail party graph CP n . The full automorphism group is the wreath 
product W = @ n [Z/2Z] of the symmetric group of degree n with Z/2Z. 
The corresponding partition of £ is {Ci, £2} with fi = {c 6 £, |c| = 1} and 
£2 = {c E £, |c| = 2}. The coloration matrix is 

( 2n — 1 n — 1 \ 

Aw = { 2n n ) ■ 

The computation of A*(CP n ) was worked out by Brilman (see Proposi¬ 
tion 14): 


A*(CP n ) = 1 I 1 + 


\Jn - 1 

VnT T 

We compute F using the reduced representation induced by the partition. 
The dominant singularity of L is (1 — V1 — n -1 ) and we obtain 


AM(CP n ) — — ( 1 + 


n 


2 sjn — \Jn — 1 / 

The dominant singularity of H is (3 n — 1 — \/9n 2 — lOn + 1) /2n, and we get 

A(5n - 1 - A) 


7M(CP n ) 1 = 


-, A = v 9n 2 — lOn + 1 . 


2A(4n - 1) - 2(8n 2 - 9n + 1)' 

Considering the Markovian matrix A\y and using (|l2|), we obtain 

9n — 7 


A cf (CP n ) = 


12n - 10 
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We check that lim n A*(CP n ) = lim n AM(CP n ) = 1 and that lim ra 7 m(CP„) 1 
= lim n A cf (CP n ) = 3/4. 


Remark that CP 3 = T 4 . By specializing the above results to n = 3, we 
get the average heights for the triangular graph T 4 considered in Examples 

We have 


2.1,E 


3.4, and 4.3 


A*(T 4 ) = ^ 
Am(T 4 ) = = 


= 0.854-, A cf (T 4 ) 
0.922 • • • , tm(T k)^ 1 


{§ = 0.769-•• , 

= 0.735- ■■ 


A Proofs of the results in section [572 


This section is devoted to the proof of Propositions 5.1, 5.2, and 5.3. 


Proof of Proposition 5.1. 


We give the proof for Am- The one for 7 m is similar (and easier!). Recall 
that L = F(l,y) = i s the length generating function. Define 


G 


£(i-») = £*<A/'- 


Assuming existence of the limit in (]?]), we have Am = lim ra (G[n)/(n(L|n)). 

According to Pringsheim’s Theorem pi] , Sec. 7.21], L and G have a 
positive real dominant singularity. They are denoted respectively by pl 
and pc according to the previous conventions. Since we have n(L\n)/C < 
(G\n) < n(L\n), it implies that pl = pc■ Let be the order of pl in L. 


Assume that pl is the unique dominant singularity in L. Assume that 
the order of pl in G is {ki, + 1) and is strictly larger than the one of the other 
singularities of modulus p l (there might exist several dominant singularities 
for G, see section 5.1). Then the limit in (|7|) exists and we have, 


[G(y)(pL ~ y) kL+ \= PL 

k LPL ■ [L(y)(p L - y) k L } ly=pL 


In particular, Am is an algebraic number. The above assumptions on the 
dominant singularities of L and G ensure that the sequences ((L\n)) n and 
((G|n)) n do not have an oscillating behavior. It remains to prove that these 
assumptions actually hold. 

We work with the representation (u, A(x,y),v(x,y)) of F given in the 
statement of Proposition ^T]. We have 

uAdj (I - A(x,y))v(x,y) 
det (I-A(x,y)) 
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where det(.) stands for the determinant and Adj(.) for the adjoint of a 
matrix. Set Q(x,y ) = det(/ — A(x,y)) and F = P(x,y)/Q(x,y). It follows 
that we have L = P(l,y)/Q(l,y). By differentiating F. we get 

G= «b(S< i ’»)- £ -S< i ’»))- (21) 

Set Q(y) = <5(1, y)- The above equations imply that the set of singularities 
of L (resp. G) is included in the set of singularities of 1/Q(y). In particular, 
a dominant singularity of L (resp. G) has a greater modulus than a dominant 
singularity of 1/Q(y). 

The next step consists in transforming the triple (u, A{ 1 , y),v( 1, y)) into 
another triple (■ u,yA,yv ) of dimension K > |(£|, where u 6 N lx ^,A 6 
N KxK ,v € N^ xl , and where we set yA = (yA l j) l] and yv = (■ yvi)i■ 

Before formally defining it, we illustrate the construction on the figure 
below. As usual we view a triple as an automaton with multiplicities, i.e. 
as a weighted graph with input and output arcs. We have represented the 
portion of the automata (u. A{ 1, y),v( 1, y)) and (u, yA , yv) corresponding to 
the cliques u and v where |w| = 3, |u| = 2 , and (it, v) is CF-admissible. 



( u , A(l.y), v(l,y)) 


(u, yA, yv) 


Consider the index set 

E = {(c, 1),... ,(c, |c|),c e £} . (22) 
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Let us define u £ N lxE , A £ N ExE , and v £ N Exl as follows: 

f 1 if i = (c, 1), c £ £ 
u t = < 

I 0 otherwise , 

1 if i = (c, k),j = (c, k + 1), c £ £, 1 < k < |c| 

1 if i = (c, |c|), j = (c, l),c £ £ 

1 if * = (c, |c|),j = (d, l),A cd / 0 

0 otherwise, 

1 if i = (c, |c|), c £ £ 

0 otherwise. 



In an automaton, an input (resp. output) node is a node with an in¬ 
put (resp. output) arc. A successful path is a path from an input node 
to an output node. There is a one to one mapping between successful 
paths in the automata (u, A(l,y),v(l,y)) and ( u,yA,yv): to the success¬ 
ful path (ci,...,Cfc) in (u, A(l,y),v(l,y)) corresponds the successful path 
((ci, 1 ),... , (ci, |ci|), (c 2 , 1 ), • • •, (c*;, |c fc |)) in ( u,yA,yv ), and vice versa. Note 
that the lengths of corresponding paths do not coincide. Using this corre¬ 
spondence, we get that 

L = l + u(I - A( 1 , y)) _ 1 u(l, y) = l + u(I - yA)~ l yv T . (23) 

Let us prove that 

Q(y) = det(I - A( 1 , y)) = det(J - yA) . (24) 

Given a matrix M of dimension n, we have 

det(M) = E sgn(cr)M l£r ( 1) • • • M na(n) , 

ff£ &n 

where & n is the set of permutations of { 1 ,... ,n}, and where sgn(.) is the 
sign of a permutation. The permutations having a non zero contribution to 
the determinant are the ones which correspond to a partition into simple 
cycles of the nodes of the graph of M. 

We have seen above that there is a one-to-one correspondence between 
successful paths in the graphs of A(l,y) and yA. There is also clearly a 
one-to-one correspondence between simple cycles in (the graphs of) A[l,y) 
and yA. When comparing the simple cycles of (/ — A(l,y)) and (I — yA), 
one needs to be more careful. 

Let S be the set of simple cycles of (/ — A( 1, y)) and let S be the one of 
(I — yA). To the simple cycle c = (ci,... ,cjf) in S, there corresponds the 
simple cycle c = ((ci, 1),... , (ci, |ci|), (c 2 ,1), - - -, (c k , |c fc |)) in S. A simple 
enumeration shows that 

S = {c, c £ 5} U {((c, i)),c £ £, |c| >!,!<*< |c|} . 
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Given c E S (resp. S'), we denote by w(c) the contribution of c to det (I — 
A{ 1, y)) (resp. det (I — yA )). More precisely, for c = (c±,..., Ck ) and setting 
M = I — A(l,y) (resp. M = I — yA), we set 


w(c) = ^2 sgn(cr)M ( 

o-eSfc 




Consider c = (ci,...,Cjt) € S. We have 
rc(c) = 


1 — yl Cl l if A: = 1 

(—l) fc_1 nti -2/ |ci| = -2/ Ei |ci| if A: > 1 


Let c = ((ci, 1 ),... , (ci, |ci|),..., (cfc, |c fc |)) be the corresponding cycle of S. 
Then we have 


w(c) 


1 — y if k = 1 and |ci| = 1 

(—l)Ei 1*1-1 ]“[ ? fc =1 ngi —y = —^ otherwise 


We check that iy(c) = rc(c) except in the case k = l,|ci| > 1. In this 
last situation, we have w(c) = 1 — yl ci l and w(c) = — yl cl L However, this 
difference is precisely compensated by the contribution to det (I — yA) of 
the simple cycles in {((c, i)),c G (£, |c| > 1,1 < * < |c|}. We conclude that 
det(I — A{ 1, y)) = det(J — yA). 


Let (Yii,Di),i G U, be the maximal connected subgraphs of ( T,,D). Let 
£ be the set of cliques of (E, I) and let be the one of (Ej, Ij). 

For V C It, define Gy = m v ^y c v, c v G Gi;}- Note that we have G* = 
Grji. The set G is partitioned by the sets Gy,H C 'll. Let T be the graph 
of cliques of M(E,D). Using Lemma 3.2, we get that the maximal strongly 
connected subgraphs of T are the subgraphs with sets of nodes Gy, V Cll. 
Clearly, there is a path in T from a node in Gy to a node in Gy/ if and only 
if W C V. 


It implies the following. The restriction of the matrix A to the index set 
Gy, denoted by Ay, is irreducible. Now range the index set G according to 
the order Gfq,..., G[/ fc , where U\, ..., 14, is an ordered list of the subsets of 
'll satisfying the property: Ui C Uj =>- i > j. Then the matrix A is block 
upper-triangular with the blocks Ajj x , ... ,Ajj k , on the diagonal. An analog 
statement holds for A, replacing Gy by Gy = {(c, 1),... , (c, |c|), c G Gy}. 
We denote by Ay the restriction of A to the index set Gy. We have 


det (I - A(l,y)) 


det (I — yA) 


det (I - A v (l,y)) 

Vcu 

det (I - yA v ) . 

vcu 


(25) 
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Given an index set S and s £ S, define 3 S G N lxS by (J s ) s = 1 and 
(J s )< = 0,t / s. For i.j £ £. define 

L ij =3i(I-A(l,y))- 1 y^3j . 


The coefficient (L,^ \n) can be interpreted combinatorially as the number of 
paths from i to j with weight y n in the automaton (u,A(l,y),v(l,y)). In 
particular, we have L = 1 + JT j Ljj. With a proof similar to the one of 
(23), we get 


^ = %{I - A(l, y))- l y\ j ^ = %,i) (/ - yA) 


~V5,bl) 


(26) 


Consider u £ £. Let A{l,y)[ u ] denote the matrix obtained from A(l,y) 
by replacing the line and the column u by a line and a column of zeros. 
Then we have 


y\ u \kd](I - A{l,y)) uu = y ]ul det(J - A(l, y) [u] ) 
det(I — A(\, y)) det(I - A(l,y)) 


Let -Ar u i denote the matrix obtained from A by replacing the line (u, |rt|) 
and the column (it, 1) by a line and a column of zeros. With a proof similar 
to the one of (|4|), we get 

y |u| det(7 — A{\, y)[ u \) = ydet(I - yA [u] ) . (28) 

Assume that u belongs to £jj and let denote the restriction of Ar u i to 

the index set £u- Using (p7|), (p^), (p8|), and (p5|), we obtain 

y det (I - y A [u] ) ydet (I-yA Uj[u] ) 

J-Jnini - ' - ~ • ( U I 

det(I — yA) det (I — yAjj) 


We have Ajj^ u ] < Ajj (for the coordinate-wise ordering) and Afj^ / Ajj. We 
have seen above that Au is irreducible. According to the Perron-Frobenius 
Theorem for irreducible matrices (see for instance [p9|, Chapter 1.4), it 
implies that the spectral radius of Au is strictly less than the one of Au. 
Now, the roots of the polynomial det(I — yA U} [ u i), resp. det(I — yAu), are 
the inverses of the non-zero eigenvalues of Aj/m, resp. Au- Hence the 
possible simplifications between the numerator and the denominator in the 
right-hand side of (29) do not involve any dominant singularity. 

We conclude that the dominant singularities of L uu are precisely the 
dominant singularities of 1/det (I — yAu). 


We have (L\n) > (L uu \n) for all n. It implies that a dominant singularity 
of L has a smaller modulus than a dominant singularity of L uu . We deduce 
that a dominant singularity of L has a smaller modulus than a dominant 
singularity of l/det(I — yAu) for all U, hence a smaller modulus than a 
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dominant singularity of 1/Q(y). Using that (G\n) > (L|n), we obtain the 
same result for G. 


We conclude that the modulus of the dominant singularities of L, G, 
and 1 /Q(y) are equal. Furthermore, the sets of dominant singularities of 
L and G are included in the set of dominant singularities of 1/Q(y). Since 
Q(y) = det(I — yA), the set of dominant singularities of 1/Q(y) is also equal 
to the set of inverses of maximal eigenvalues of A. Let p(A) = p^ 1 denote 
the spectral radius of A. 


First assume that p(A) = pi = 1. According to Lemma |4.4j, M (X, D ) 
is the free commutative monoid over X. The analysis of section |6.1| applies. 
In particular, the limit Am in (iTj) exists and is given in (19). It is obviously 
algebraic and even rational. Hence Proposition 5.1 is satisfied in this case. 


From now on, we assume that p(A) > 1. Let us specialize for a moment 
to the case where (X,D) is connected. Using the above analysis, the matrix 
A is irreducible. For any a E X, we have > 0. We conclude that 

A is primitive. By Perron-Frobenius Theorem for primitive matrices ( |29|l , 
Chapter 1.1), the matrix A has a unique eigenvalue of maximal modulus 
which is positive real and of multiplicity 1. 


We conclude that pl is the unique dominant singularity of L and G. We 
conclude also that the order of pl is 1 in L, and at most 2 in G. Since 
nC' _1 (L|n) < (G\n) < n(L\n), we deduce that the order of pl in G is 2. 

We have just proved that the result of Proposition [L^ holds for L. The 
proof of Proposition |4.5| for H is similar (and easier). 

Let us come back to the general case for (X,D). Since we have now 
proved Proposition |4.5| , we are allowed to use Proposition |4.6| (the proof of 
the latter requires the former). We conclude that in all cases, L has a unique 
dominant singularity. 


It remains to study the set of dominant singularities of G. To do this, 
we study the set of eigenvalues of A of maximal modulus. 

Fix a subset V C U and consider the restricted matrix Ay- Let p{Ay) 
denote the spectral radius of Ay. We distinguish between two cases. 

Case (I). Assume there exists v E V such that M {Y, V ,D V ) is different 
from the free monoid X*, or equivalently such that I v is not empty. Then 
there exists c,d£ £y such that |c| = \d\ + 1. It implies that the cyclicity of 
the matrix Ay is 1. Since Ay is irreducible, we deduce that it is primitive. 
According to the Perron-Frobenius Theorem for primitive matrices (J2f|, 
Chapter 1.1), the matrix Ay has a unique eigenvalue of maximal modulus 
which is positive real and of multiplicity 1. 

Case (II). Assume now that M (T, V ,D V ) = X* for all v E V. It implies 
that |c| = \V\ for all c E £y. The cyclicity of Ay is |V| and Ay is not 
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primitive as soon as \V\ > 1. However in this case, we are able to completely 
compute the spectrum of Ay. Set K = n,cy 1^1- It is more convenient to 
work with Ay(l,y). Using the same arguments as in the proof of (p4|), we 
get 

det(/ - A v ( 1, y )) = det(/ - yA v ) . 

We also have Ay{ 1, y ) = y\ x ^Ay{l, 1) and Ay( 1,1) is the matrix of dimen¬ 
sion K x K whose entries are all equal to 1. The eigenvalues of Ay{ 1,1) are 
0 with multiplicity (K — 1) and K with multiplicity 1. We have 

det(J — y\ v \A v (l, 1)) = y K|y| det(y _|y| / - A v (l, 1)) 

= y K ^(y- lVl ) K -\y- lVl -K) = (l-Ky^). 

It follows that the non-zero eigenvalues of Ay are 

A' 1/|y l exp (^p), k = 0,..., \V\ - 1, (30) 

all with multiplicity 1. In particular, we have p(Ay) = K l ^ v L According 
to (|25|), the spectral radius of A is given by 

p(A ) = max / o(Ai/) . 

Fell 

Dehne V = \V C 'll : Vu G U, /„ = 0} and S = {V C 11 : p(Ay) = p{A)}. 
Using the above analysis, we can distinguish between two situations. 

First, assume that VnS = 0. According to Case (I), it implies that p(A) 
is the only eigenvalue of maximal modulus of the matrix A. We conclude 
that pc = Pl = p(A)- 1 is the only dominant singularity of G. 

Second, assume that there exists U G V n 8. According to Case (II), it 
implies that 

p{A) = p(Ajj) = ( n |E tt | ) 1 /l^l . (31) 

u&U 

Let y(S) denote the power set of a set S. We deduce easily from © that 
V n S = T(V), with V = {w G V : |£„| = max ||} . 

x£V 

We conclude that the set of maximal eigenvalues of A is precisely given by 
I J < (max |S X |) exp (2iTtl/L), l = 0,... ,L — 1 > . (32) 

L =1 L " eV J 

Set K = max xg v |^x|- The multiplicity of the eigenvalue K is at least \V\ 
(and it is exactly |U|if§ = T(U)). For a complex and non positive real max¬ 
imal eigenvalue, the multiplicity is exactly the number of appearances of the 
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eigenvalue in (|32|). The maximal such multiplicity is equal to |_| V"|/2j and 
attained for the eigenvalue —K. It follows that the maximal order of a com¬ 
plex and non positive real dominant singularity in 1/Q(y) is |_| V| /2J. Using 


(21), we conclude that the maximal order of a complex and non positive real 
dominant singularity in G is |_| V|/2j. 


Now we also have L = \\ 

(E v ,I v ),v e V, is (1 — Ky ). We deduce that 


Vv&V 


Wv^v Lv- 


The Mobius function of 


L = 


1 


(1 - Ky)\V I 


n Ly 

v&V 


The order of the singularity 1/K in L is consequently at least |U|. Since 
we have n(L\n)/C < ( G\n ) < n(L\n), we deduce that one of the dominant 
singularities of G must be of order + 1) > \V\ + 1. Since we have 
[|U|/2j < |V| + 1, the only possible choice is 1/it'. 

We conclude that the positive real dominant singularity of G has a 
strictly larger order than all the other dominant singularities. It completes 
the proof. □ 


Proof of Proposition 


5.1 


The notations are borrowed from the statement of Proposition |5.2| . To 
avoid trivialities, assume that J S. Let (Si, Hi) = U Dj), and 
(S 2 , D 2 ) = Uj 6 (5_j)(Sj, Dj). Let L , Li, and L 2 be the respective length 
generating functions of (S,H), (Si, Hi), and (S 2 ,H 2 ). By construction, 
we have pl 1 < Pl 2 - Let kr n and be the order of pl x and pl 2 in their 
respective series. 

According to Proposition |4.6|, we have (Li|n) = a n n fcl_1 p^/ with lim n a n 
= a E M+, and (L 2 \n) = b n n k2 ~ 1 pf with lim n b n = b E R?j_. Furthermore, 
we have L = L 1 L 2 , hence 

n 

(L|n) = ^o n _i(n-i) fcl-1 p2 1 (n_ ‘ ) &i* fc2- VlJ 

i =0 

= an kl ~ l pf™ (S^ian-i/ajbii* 2 - 1 ^ - i/n) kl ~ l (p Ll /p L2 ) % 

\i =0 

Since pl x IPl 2 < 1, the series B = Ylt^o ^ > P k2 ^ 1 {PL 1 /Pl 2 Y is convergent. 
Furthermore, we obtain easily that 

n 

lim V^(a n _j/a)6ji fe2_1 (1 - i/n) kl ~ l {p Ll /p L2 ) 1 = B . 

n zJ 

2—0 

We deduce that (L\n) ~ aBn kl ~ l pff\ 
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Let / be an increasing map from N to N such that lim n f(n) = +oo and 
lim n f(n)/n = 0. Define 

= {t G M(£, D), |t| =n,n- f(n) < |t| Sl < n} , 

where |f| Sl = E aeSl |*|a- We have 

/ f( n ) 

#L n = an fcl "Vzr 5^(on-i/o)6i** i2_1 ( 1 - i/n) kl ^ l {p Ll / Pl 2 ) 1 

\i =0 

Since lim n /(n) = +oo, we have ~ a5n fcl_1 p~[™ and lim„ #L n /(L|n) = 
1. Let M n = {t G M(E, D), |f| = n} and note that #M n = (L|n). We have 

» v St&Cn MO + M„-,C„)M0 

AmS,D = lim- — - 

n n • 

= lim MO + lim Z)te(M„-£„) MO 


n ■ #L r 


n • #M J1 


Pn 


Using the inequality h(t) < |t|, we obtain 


Z)te(M„-£„) MO < n • (#M n - #£ n ) ra ; Q 


n ■ 


n • #M, 


We now consider the terms g n . Given a trace t, we can decompose it as t = 
01 (002(0 with 0i(t) G M(Ei,Di) and 02(0 G M(E 2 >-^ 2 )- Consider a trace 
t such that |t| = n and |t|> n — f(n). We have L(0i(t)) > C' _1 (n — /(n)) 
and /r(02(t)) < /(n), where C is the maximal length of a clique. Using that 
lim n f{n)/n = 0, we obtain that, for n large enough, h{t) = M0i(i)). Hence, 
we have, for n large enough, 


9n= 

i=n-f(n) 


#{\t\=n,\t\- El =i} E|t|=n,|t| Sl =i M0i (0) 


#L n n • #{|t| = n, |t| Sl = *} 

Given G M(Si, Z?i) and n > |u|, we have 

#{f G M(S, L>), |f| = n, 0i(t) = «} = (L 2 |n - |it|) , 


(33) 


which depends on u only via its length. We deduce that 

X)teM(E,D),|t|=n,|t| El =i M0l(O) _ i Z)teM(Ei,Di),|t|=i MO 
n ■ #{i G M(E, D), |t| = n, |t|s! = i} n i ■ #{i G M(Ei, D\), |i| = i} 

~ (i/n)AM(Ei, Di) . 

Replacing in (|33|), we conclude that Am(E,D) = lim„ g n = Am(Ei,Di). □ 
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Proof of Proposition \ 5. j|. 


The notations are the ones of the statement of Proposition 5.3. Assume 


first that M(E, D) is the free commutative monoid over E. According to the 
results of section |6.1|. we have indeed 7 m(E, D) = (|E| + l)/2. 


Assume now that M(E,D) is not the free commutative monoid. Let 
(E 2 , D 2 ) be a maximal connected subgraph of (E, D) and let Si = E—S 2 and 
D\ = D — D 2 . Denote respectively by H, Hi, and H 2 the height generating 
functions of M(S, D), M(Ei, D\), and M(E 2 ,Z? 2 ). We choose (E 2 ,D 2 ) so 
that M(Ei,Di) is different from the free commutative monoid. According 
to Lemma |4.4| . it implies that pn 1 < 1 • We are going to prove the following 
equalities 


7m(E, D ) = 


j 7m(£i, Di) + 7m(S 2 , D 2 ) 

Di) + 1/2 


if | E 2 1 > 1 
if | S 2 1 = 1 


(34) 


Formula (15) follows easily from the above. 


Assume first that |S 2 | > 1. According to Lemma 4.4, it implies that 
Ph 2 < 1- Applying Propositions |4.5| and |4.6| , we have ( Hi\n ) = a n n ki1 ^ 1 pjf^ 
and ( H 2 \n ) = b n pjj n with lim n a n = a and lim n b n = b. Using (Q) and 
performing the same type of computations as in the proof of Proposition 
L6l we get 


(H\n) ~ ab{ 


1 


+ 


1 


- l)n fc «i \p Hl pH 2 )~ 


1 - PH 1 1 - PH 2 


(35) 


We define the maps /, </>i(.) and </> 2 (.) as in the proof of Proposition 5.2 . 
Consider the set 


< H n = {t£ M(E, D), h(t ) = n, h{cfi(t)) >n - f(n), h{f) 2 {f)) >n - f(n)} . 


Using the same type of arguments as in the proof of Proposition |5.2| , it is 
easily seen that lim„ ff}i n /{H\n) = 1. Set n M = {t £ M(E, D), h(t) = n} 
and note that # n M = (H\n). We have 


7m(£,D) 


.. 1*1 + 1*1 

lim-——- - — 

n n• 

lim 1*1 , 1*1 

n n ■ ffH n n ■ # n M 


Using the inequality |t| < Ch(t), where C is the maximal length of a clique, 
we obtain 

1*1 < Cn (# n M - ffHn) n ; 
n • #„M “ n • # n M 
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Using the equality \t\ = |</>i(t)| + |0 2 (f)|, we obtain 


7m(S, D) 


lim I ^1^)1 Ste:H„ l<M*)| 

n n ■ #Jf n n ■ 

7m(Si, D\) + 7m (S 2 , D 2 ) , 


where the last equality is obtained exactly in the same way as in the proof 
of Proposition |5.2| . 

Assume now that |S 2 | = 1. Then we have pn 1 < 1 and pn 2 = 1- It 
implies that (Hi\n) ~ an kli i 1 p H 1 l and (i7 2 |n) = 1. Using @ , we obtain 
that ( H\n) ~ n(Hi\n). Now, by a direct computation, we get, for u 6 
M (Xi,Z?i), 

G M(S, D ), = u, h(t ) = /i(«)} = /i(w) + 1 . 


Dehne the set !K n = {t € M(E, D), h(t) = h(c/)i(t)) = n}. We have = 
(n + l)(Hi\n) ~ (H\n). It implies that 


7m(S, D) 


lim \Mt)\ Etgjo, l<Mt)| 

n n • #d{ n n • #!H n 

( n + I) EteM(£i,Di),h(t)=n |t| EtgM(Si ,D\),h(t)=n Yhi =0 * 

n n ■ n(Hi\n) n • n(Hi\n) 

7m(Si, Di) + 1/2 . 


This completes the proof. 


□ 
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B Trace monoids over 2, 3, and 4 letters 


We give the values of the average heights for all the trace monoids over 
alphabets of cardinality 2, 3, and 4. On the tables below, a trace monoid 
is represented by its (non-directed) dependence graph. For readability, self¬ 
loops have been omitted in the dependence graphs. We have not represented 
the free monoids for which A* = Am = 7m = A c f = 1. 



A* 

Am 

'Am 

A c f 

o o 

1/2 

3/4 

2/3 

1 


I. Trace monoids over 2 letters 


The values in Table I can be obtained using the results in section 6.1 



A* 

Am 

7m 

A c f 

o 

1/3 

11/18 

1/2 

1 

o o 





/ 

2/3 

1 

2/3 

1 

o o 





A 

(10 + >/5)/15 

(7 +a/5)/10 

9/11 

8/9 


II. Trace monoids over 3 letters 

All the values in Table II except one can be obtained using the results 
from the paper. The exception is A* for £ = {a, b, c},I = {(6, c), (c, b)}, 
which is computed in |27]], Example 6.2. 
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Am 

7m 1 

A c f 

1 

o o 

o o 

1/4 

25/48 

2/5 

1 

2 

c 

c 

3 ° 

3 O 

1/2 

1 

1/2 

1 

3 

c 

c 

3 C 

3 C 

3 

3 

1/2 

3/4 

1/2 

1 

4 

c 

c 

3-O 

3 O 

(10 + \/5)/20 

(7 + \/5)/10 

18/31 

52/57 

5 

c 

3--0 

3 O 

3/4 

1 

2/3 

1 

6 

c 

c 

3-C 

3 C 

3 

3 

? 

19/22 

(13 - 2\/l3)/9 

5/6 

7 

o—-c 

o c 

3 

in (||) 

in ®) 

in (38) 

11/14 

8 

c 

3--C 

3 C 

3 

(5 + V2)/8 

(6 + \/2)/8 

in (39) 

7/8 

9 

c 

c 

3-C 

3-C 

3 

3 

(3 + \/3)/6 

(11 + >/2)/14 

(51 + ^l7)/76 

11/14 

10 

c 

3--C 

3—-C 

3 

(9 + V / 3)/12 

(4 + v / 3)/6 

(3a/5 — 5)/2 

8/9 


III. Trace monoids over 4 letters - exact values 
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o o 

o o 


o o 

o o 


A* 

Am 

O'm 

0.25 

0.521-•• 

0.4 

0.5 

1 

0.5 

0.5 

0.75 

0.5 

0.612-•• 

0.923-•• 

0.581--- 

0.75 

1 

0.667-•• 

0.691 • • • 

0.864-•• 

0.643 • • • 

0.681 • • • 

0.873-•• 

0.676 • • • 

0.802 • • • 

0.927-•• 

0.760-•• 

0.789-•• 

0.887-•• 

0.725-•• 

0.894 • • • 

0.955-•• 

0.854-•• 
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Let us denote the dependence graphs in Table III, listed from top to 
bottom, by (X,A),* = 1,..., 10. The graph (X,Dg) is the cocktail party 
graph C -P 2 , hence the values of the average heights can be retrieved from 
section \5.2[ More generally, most of the values in the table can be computed 
using the results from the paper. The exceptions are A* for (X,Z?j),i = 
6,7,8, and 10. For (X,Dg) and (X,Dio), the value of A* can be computed 
by applying Proposition 12 from [j5j. 

For (X, Dq), the exact value of A* is not known. Using truncated Markov 
chains, A. Jean-Marie p3( | obtained the following exact bounds: 

A*(£, A;) <E [0.69125003165,0.69125003169] . 

Let us concentrate on Let (x n )neN* be a sequence of inde¬ 

pendent random variables valued in X and uniformly distributed: P{x n = 
u} = 1/4, u G X. Define X n = ■ ■ -x n ), then ( X n ) n is a Markov chain 

on the state space M(X,Z> 7 ). Let a be the letter such that (a, u) £ D 7 for 
all m£X, Define T = infjn : x n = a}. An elementary argument using the 
Strong Law of Large Numbers then shows that A*(X, D 7 ) = E[Ii(Xt)]/E[T]. 
It follows that 




4 * E 

«GN il+*2+*3=* 


max(«i,i 2 ,* 3 ) 


*l)*2D3y 


(36) 


This expression involves non algebraic generalized hypergeometric series. 
By truncating the infinite sum and upper-bounding the remainder using the 
inequality max(U,* 2 ,* 3 ) < h + ii + 13 , we get the following exact bounds: 


A*(X,X> 7 ) g [0.68111589347,0.68111589349] . 

Another formula for A*(X,Z) 7 ) involving multiple contour integrals and due 
to Alain Jean-Marie is given in [^, Th. 13]. 

The closed form expressions for Am(X, D 7 ) and tm(X, D 7 ) are not given 
in Table III since they are too long and do not fit. We have 

8(—93 - 9\/93 - v/93A + 5X 2 ) 

-1734 - 186\/93 + (141 - 5\/93)X + 67A 2 
(108 + 12\/93) 1/3 , (37) 


10777(529 - 23T 2 + T 4 )(829 + 132^/62 - (139 - 6^62)^ - 11T 2 ) 

3(3779 + 372y / 62)(98340v / 62 — 1461365 - 1529(149 + 66\/62 )Y - 53885T 2 ) 

(38) 


Am(X,D 7 ) = 
X = 

and 7 m(X, D 7 )~ x = 


with Y = (89 + 18\/62) 1/3 . 
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At last, let us comment on the value of 7 m for (£, D$). Using the results 
from section 5.2, we get 


7m(£, £>8) 1 


(1 - 2a) (4- 5a) 

7 - 27a + 24a 2 ’ 


(39) 


where a is the smallest root of the equation 2x 3 — 8x 2 + 6x — 1 = 0. Nu¬ 
merically, we have a = 0.237 • • • and = 0.760 • • • . In this case, Cardan’s 
formulas are of no use (they provide an expression of the real a as a function 
of the cubic root of a complex number). 

Let us conclude by going back to the original motivation of compa¬ 
ring the degree of parallelism in different trace monoids. We claim for 
instance that there is some strong evidence that (X,Dg) is ‘more paral¬ 
lel’ than (S,£)g). Indeed we have A*(X,iAg) < A*(E,Dg), Am(X,£> 9 ) < 
Am(E,-D 8 ), 7 m 1 ( S ’- D 9 ) < 7m 1 ( S ’- D 8), and A c f(S,D 9 ) < A c f(E,D 8 ). 
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